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(57) ABSTRACT 

A videoconferencing application suitable for large confer- 
ences with many terminals where displayed speakers are 
changed dynamically based on some schema employs ani- 
mation effects to indicate the change. When a new speaker 
replaces an old, the new speaker's image is gradually 
transitioned into the place of the old speaker's by an 
animation that provides a clear cue and a visual metaphor for 
the event. A preferred embodiment translates the new image 
in and the old one out as if the new one were pushing the old 
image out of its place. Various other animations are also 
described which provide a similar result. 

7 Claims, 6 Drawing Sheets 
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FIG. 1 
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METHOD AND SYSTEM FOR INDICATING 
CHANGE OF SPEAKER IN A 
VIDEOCONFERENCE APPLICATION 



FIELD OF THE INVENTION 

The invention relates to communication systems employ- 
ing at least two interconnected terminals supporting video- 
conferencing processes in which multiple parties commu- 
nicate continuously and in which, at any given time, only 
one person is normally speaking. The invention further 
relates to such an environment where the terminals display 
images of at least one of the most recent speakers and which 
indicate a change in a most recent one of the speakers. 

BACKGROUND OF THE INVENTION 

Various multi-user communication systems, for example, 
groupware applications, distance learning systems, white- 
board applications, and videoconferencing systems permit a 
group, sometimes very large groups, to collaborate in real 
time. Distance learning systems are used for enabling stu- 
dents and teachers to attend and give lectures without the 
requirement of physically being in the same room or even 
the same building. Probably most widespread are videocon- 
ferencing systems. They allow people to have meetings 
without being restricted by the conference location. An 
example of such a conference system is described in U.S. 
Pat. No. 5,675,374, the entirety of which is incorporated by 
reference as if fully set forth herein. This system aims at 
establishing a videoconference which closely resembles a 
face-to-face conference held at a single location. The fol- 
lowing invention may be used in this In such systems, each 
participant, or subgroup of participants, may have a terminal 
with a camera and at least one display. Communication can 
be oral or by typing, augmented by other mechanisms such 
as document scanning, handwriting recognition or direct 
image transfer, etc. One of the most important elements of 
such systems, however, is the visage of active participants. 

To allow users to see each other, camera images are 
selectively displayed on each display. A typical display may 
show the field of view one or more cameras. If the confer- 
ence includes many participants, the display of all the 
participants at all times becomes impossible so such systems 
select some subset of the participants based on criteria. One 
of the criteria typically used to identify the most worthy of 
the group to display is how recently the speaker has spoken. 
For example, the displays may show the last three speakers 
that spoke. When one of these has been dormant for a while, 
his/her image will be replaced by another who has spoken 
more recently. 

The ongoing maintenance of the select group of partici- 
pants results in sudden changes in the faces of the individual 
participants that spoke most recently. There are at least two 
problems. First, the sudden change from one face to another 
may not be very conspicuous so it might not be noticed. 
Second,. the transition can be abrupt and may therefore be 
disconcerting. An improved system for making the changes 
in the displayed subgroup more clear and natural is desir- 
able. 

SUMMARY OF THE INVENTION 

Briefly, a typical display used in videoconferencing envi- 
ronment shows a subset of a larger group of participants, the 
subset being characterized by some selection criterion. For 
example, the three that have spoken the most up until the 
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present moment may be maintained on the screen. 
Alternatively, the three that have spoken most recently may 
be maintained on the screen. Of course, the subset can be 
any practicable size and the number three is simply an 

5 example. When a change in the subset is required, one 
speaker is replaced by another in the following fashion. An 
animation is instantiated in which the image of the speaker 
that is to be replaced is scrolled off the screen and the new 
one is scrolled on as if pushing the old image off the screen. 

10 Various other alternative animations can be employed, the 
common theme of them being that there is a gradual tran- 
sition and the transition is conspicuous. 

To insure the animation is prominent and noticeable, a 
fade would not be ideal, even though such an effect is 

15 gradual and not terribly disconcerting. Another example of 
a suitable effect is a peel-away effect where the old image 
curls up from one edge to reveal the new image below it. 
Still another is a fade into the distance followed by a 
foreshortening effect or slide-in effect. The basic require - 

20 ment is that the effect result in a gradual replacement of the 
image of the old. speaker by a new speaker but in such a 
fashion as to be noticeable but visually agreeable. 

A wide range of video animation effects can be produced 
using digital technology. These include shape changes such 

25 as gradually wrapping a planar image onto a cylinder or 
sphere, paper peel away effect, image breaking up into small 
bits and scattering, etc. Such effects are often used in 
football broadcasts. The idea in the invention is not to dazzle 
but to provide a visual cue and metaphor for the change of 

30 a member of group, like someone coming in a room and 
another leaving. 

BRIEF DESCRIPTIONS OF THE DRAWING 

FIG. 1 is an illustration of hardware components that may 
35 be used to implement embodiments of the invention. 

FIG. 2 is a figurative representation of a first participant- 
image changeover animation provided by an embodiment of 
the invention. 

40 FIG. 3 is a figurative representation of a second 
participant-image changeover animation provided by 
another embodiment of the invention. 

FIG. 4 is a figurative representation of a third participant - 
image changeover animation provided by yet another. 

45 embodiment of the invention. 

FIG. 5 is a figurative representation of a fourth 
participant-image changeover animation provided by yet 
another embodiment of the invention. . 

FIG. 6 is a figurative representation of a fifth participant- 

50 image changeover animation provided by yet another 
embodiment of the invention. This animation illustrates a 
candidate speaker appearing momentarily as he begins 
speaking. As the speech continues, the new image grows 
until, once the speech reaches a threshold duration, it starts 

55 to displace an old speaker. 

DETAILED DESCRIPTION OF THE DRAWINGS 

Referring now to FIG. 1 a videoconferencing system with 
several network terminals 110, 112 in communication over 

60 a network 115. Each terminal may have a television/monitor 
230 receiving video output from a video stream combiner 
and visual effects generator 240. The latter receives video 
and/or data signals 260/270 from an Internet server, cable 
connection, switched connection, etc. A camera captures 

65 video from the location of the videoconferencing system. 
This video is shared with other like stations through a 
suitable network and gathered in the same way. 
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Referring to FIG. 2 a typical display used for applications narrows, suggesting that it is on a rectangular cylinder 

where many participants are connected through groupware, (prism) that is rotating to the left. As the imaginary cylinder 

or some similar software device, has a small number of continues to rotate, the image of speaker D begins to roll into 

images of participants A, D, and C as shown in frame 15. The view in frames 132 and 133. The effect continues with frames 

images that are ordinarily shown are a subset of the partici- 5 134 and 135 until the image of the new speaker, speaker D 

pants that are selected according to some suitable schema completely displaces that of the old speaker, speaker B. 

such as frequency of participation, title, how recently the Referring now to FIG. 6, a candidate speaker appears on 

participant spoke, etc. Because such criteria may be the display at Dl when the candidate speaks momentarily as 

dynamic, that is, they may change over time, the video snown in frame 141. If the candidate speaker Dl continues 

image of one participant must be replaced with that of 10 to speak, the candidate continues to grow as shown in frame 

another on occasion. To make this transition clear, an 142. If the candidate speaker Dl stops speaking, his/her 

animation sequence is employed which starts with frame II. i^ge shrinks and disappears. If the candidate speaker 

In frame II, a new speaker's image D has appeared on the continues speaking beyond a predefined interval, the candi- 

screen. The new speaker's image D moves into the space as date speaker image Dl grows to the size of the other 

the old speaker's image B is moved out giving the appear- 15 speakers A, B, and C and the animation sequence of FIG. 2, 

ance of being pushed out by the new speaker's image D. In beginning at frame 144 is performed. Thus, the candidate 

an alternative embodiment, the new speaker's image D now pushes the old speaker B out of the frame as illustrated 

could move upwardly from the bottom and displace the old in frames 144-146. 

speaker's image B upwardly in a similar fashion with a while the particular embodiment shown and described 
similar effect. This animated transition provides a visual 20 above nag proven t0 ^ [ n determining the change in 
metaphor for the action that is taking place, analogous to a speaker for purposes of video conferencing, further modi- 
person leaving a group by moving out as another comes in. fications of the present invention herein disclosed will occur 
The visual effect generation required to perform the above to persons skilled in the art to which the present invention 
animation sequence provides no problem as this technology pertains, and all such modifications are deemed to be within 
has been around for some time and is used with some 25 the scope and spirit of the present invention defined by the 
frequency in the television broadcast area. However in this appended claims, 
context, its purpose serves more than mere entertainment I claim: 

value. It is a cue that a change has taken place, the fact that 1. A method of indicating a change of speaker in a 

it involves gradual transitions, such as image movement, videoconferencing system, comprising the steps of: 

rather than an abrupt changes, makes it more visually 30 displaying images of current conference participants; 

appealing, the particular effect is a meaningful and imme- displaying an image of a new conference participant 

diately universally recognizable visual metaphor for the responsively to an indication of speech uttered by said 

change that has taken place. The change is that one person new conference participant; 

has left the group and another has come in. Such features are tcmoving gaid . Qf said flew conference partici p an t 

unportant elements of interaction Resign and serve to make wheQ said h ^ of ft ^ dm ^ Qn; 

a piece of equipment more user-friendly and informative. , . , . , . r . , _ 

r ^ r . . replacing one of said images of said current conference 

Referring to FIG. 3, another animation sequence that participants with said image of said new conference 

serves a similar purpose as the one shown in FIG. 2 begins participant when said speech is of a-duration longer 

with frame 111. In frame 111, the three speakers A, B, and C 4Q than ^ first duration< 

are shown prior to the changeover. In frame 112 speaker B's 2. A method as in claim 1, wherein said step of replacing 

image begins to peel upward as a piece of paper being mc i ud es one ofdisplacing and contracting said image of said 

picked up from the bottom and peeled away. The transition OQe of said cunem ntfimncc participants, 

continues through frames 113-115 with the "top" image 3 A melhod ^ in claim ^ wherem said st of replacing 

peehng away completely to reveal the image of speaker B. 45 mdudes di]ating said image of said new mnEmncc partici . 

Referring to FIG. 4, yet another animation sequence that p an t. 

serves a similar purpose as the ones shown in FIGS. 2 and 4. a method as in claim 3, wherein said step of removing 

3 begins with frame 121. In frame 121, the three speakers A, includes contracting said image of said new conference 
B, and C are shown just at the beginning of the animated participant. 

changeover. In frame 121, the first step is shown with the 50 5. A method as in claim 1, wherein said step of replacing 

image of the new speaker, speaker D, appearing small at first includes dilating said image of said new conference partici- 

in a portion of the old speaker's image B, In frame 122 p an t, 

speaker D's image begins to grow. It continues to grow until 6. A method as in claim 5, wherein said step of removing 

it covers the old speaker's image B completely. The result is includes contracting said image of said new conference 

shown in frame 125. 55 participant. 

Referring to FIG. 5, another animation sequence that 7. A method as in claim 1, wherein said step of removing 

serves a similar purpose as the one shown in FIGS. 2, 3, and includes contracting said image of said new conference 

4 begins with frame 131. In frame 131, the three speakers A, participant. 
B, and C are shown prior to the changeover. In frame 132 

speaker B's image begins to move toward the left and ***** 
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